PRINT READY DOCUMENT MANAGEMENT FOR DISTRIBUTED PRINTING 

FIELD OF THE INVENTION 

[001] The present invention is generally related to print ready document 
management, distribution and printing. More particularly, the present invention is 
related to publishing print ready document libraries using a "publish and 
subscribe" relationship, and is also related to synchronizing subscription 
information with remote subscribers by multicasting print ready documents over a 
data network. 

BACKGROUND 

[002] As networks and systems become more integrated and more 
complex, the amount and speed of information flow between users creates a 
need for more versatile and more efficient control over the information flow 
process. For print service providers with multiple production sites, the issue is 
how to best leverage and utilize technology to take advantage of the multiple 
sites. 

[003] The following Xerox Corporation U.S. patents include examples of 
systems indicating multicast communications utilizing a network, server and 
including rendering equipment (e.g., printers) usually having shared remote user 
terminals: U.S. Pat. Nos.: 6,557,111, 5,513,126 and 5,305,311. Protocols 
defining integrated system behavior for devices such as printers, scanners, 
workstations and facsimiles, are well known. These protocols define how the 
systems should integrate across networks. Operational transparency across 
networks and device platforms, provide users with an increasingly integrated and 
transparent system environment. In this environment the manipulation of 
information (such as documents) is transparent to users as a result of the various 
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network protocols that define the manner in which devices manipulate 
information. 

[004] Printers (and printer controllers or servers) are also sometimes 
referred to as "shared resources" in a networked environment. The server 
typically functions as a "spooler" to buffer the jobs that are sent to it, as well as a 
page description language (PDL) "decomposer", for converting the PDL files (e.g., 
"Interpress™ " or "PostScript®") to bitmapped files for application to the printer. 
Also of interest is Pennant Systems Co., Print Services Facility/2 (PFS/2), server- 
based software intended to act as a universal translator between various network 
protocols (LAN, TCP/IP and SNA) and printer document protocols (PCL and 
Postcript). Extensible Markup Language (XML) is also a familiar document 
transmission and conversion protocol. 

[005] Different workstations can access print services in different ways. 
Workstation selectable print options can include the number of copies, selected 
pages to be printed, paper size, image orientation, a choice of printers, and 
phone numbers when sending to a facsimile device. Already print-formatted or 
master documents may be transmitted to the printer directly, eliminating the need 
to repeat the conversion or decomposition process if another copy of the 
document is desired. If saved, a master file can be stored at the file service or 
even mailed to one or more individuals via an electronic mail service. Utilizing an 
internetwork routing service, users can transmit a document through a network 
and then across an internet, typically, via telephone lines, twisted pair wires, 
coaxial cables, microwaves, infrared, and/or other data links, allowing documents 
created in one location to be automatically routed to a print service and printer 
hundreds or even thousands of miles away, in seconds or minutes. 

[006] Having multiple sites brings up issues such as fail-over, 
redundancy/backup, and load balancing. Each location may have specific 
expertise in preparing jobs, proofing jobs, and producing jobs, but where the print 

Page 2 of 12 
Attorney Docket No. D/A2188 



service provider decides to master or print a job may depend on multi-site load 
balancing, the expertise required, and the shipping and labor costs. For these 
reasons, it is desirable to have the ability to master and print at multiple sites at 
any given time. In such cases, the print service provider would likely want to have 
specific jobs (not necessarily all jobs) ready to print at multiple locations. 

[007] Due to the steps involved in preparing and proofing a job, prepared 
print ready files are usually what are needed at each site. Although present 
systems may enable the management of source PDLs, storing the PDL induces 
additional point of failure because the ripping/processing of the PDL can produce 
different results depending upon several factors. 

[008] Manually moving print ready files from site to site, whether on 
media or over a network, becomes prohibiting when the amount of data and 
frequency increases. Centrally storing the print ready files is not an option due to: 
the speed at which it needs to be retrieved to run at rated printer speed; the 
amount of network traffic it would cause; the general lack of redundancy/backup. 
Furthermore, traditional methods of synchronizing document libraries are 
intrusive; services must be shut down while the data is being uploaded. They are 
also non-scalable, and do not occur in real-time. 

[009] Currently, Xerox Corporation's Custom Print Solution (CPS) uses 
an open-source product called "rsync" The product can replicate an entire library 
or update the deltas. The problem with rsync is that it has to be scheduled to run 
at a certain time and while it is running, CPS's Repository services have to be 
disabled. The time to run rsync can be in the vicinity of two hours to synchronize 
two print sites. Currently this is done between shifts. As the libraries grow and as 
an enterprise scales from two to multiple sites, the time to resynchronize libraries 
will become a problem. Synchronization processes can quickly consume an 
entire workshift or more. Archive tapes are also known to be used, but can pose 
even more problems for the enterprise. 
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[010] What is needed are improved systems and methods for manage 
distributed multi-site Print Ready Document libraries utilizing available data 
communications networks. 



Page 4 of 12 
Attorney Docket No. D/A2188 



SUMMARY OF THE INVENTION 



[01 1] The present invention includes a method of synchronizing multi-site 
Print Ready Document Libraries using a "publish and subscribe" relationship, and 
methods of communicating over a multicast enabled messaging system. 

[012] In accordance with features of the present invention, the use of a 
multicast communication transport layer avoids the network bottleneck, which 
would otherwise occur when transmitting large amounts of data in a point-to-point 
fashion. Furthermore, use of "publish and subscribe" allows for jobs (or 
documents) to be assigned to specific topics or subjects. 

[013] In accordance with an aspect of the present invention, subjects are 
predefined and each site which is interested in a subject expresses the interested 
through subscribing to the subject. Any site that publishes to the subject is 
assured that all sites that have presubscribed will receive the data. This allows a 
site to share only the necessary data to the necessary sites. 

[014] It is a feature of the present invention that the systems and 
processes enable sharing of specific print ready documents with other specific 
sites in an efficient manner such that the organization as a whole has the 
competitive advantage of being able to print the job in multiple sites. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



[015] Figure 1, is a block diagram of a system in accordance with a 
preferred embodiment of the present invention. 

[016] Figure 2, is a flow diagram of a method in accordance with a 
preferred embodiment of the present invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 



[017] Referring to FIG. 1, a block diagram of a system 100 for enabling 
the multicast publication, subscription and synchronization of data is shown. The 
system 100 includes access to a data network 150, at least one server 110 
providing access to topical information stored in at least one database 120 via 
enterprise communication equipment 130. The system communicates with more 
than one client 140 located at remote locations and also having access to a data 
network 150. The communication equipment 130 will include such items as a 
router 133 and a network access device 135 (e.g., modems) to provide access to 
the network 150. Publish-subscribe middleware 160 enables the at least one 
server 110 and at least one database 120 to operate together for the 
management of subscriber data, topic data and publication-subscription logs. 

[018] Referring to FIG. 2, a flow diagram 200 of a method in accordance 
with a preferred embodiment of the invention is illustrated. A method of 
synchronizing multi-site print ready document libraries begins at block 210. 
Referring to block 220, document library subject availability is published via 
multicast communication over a data network. Referring to block 230, 
subscriptions for document library subjects are received over the data network 
from remote subscribers at individual sites. At block 240, data representative of 
the document is periodically synchronized over the data network with remote 
subscribers located at individual sites. The process ends at block 250. 

[019] Jobs can be assigned to subjects and can also represent 
documents for publication that are associated with the topics. It should be 
appreciated that remote subscribers at individual sites interested in a subject can 
subscribe to the subject and receive document updates automatically, wherein 
only necessary data related to the subject is necessarily synchronized with the 
remote subscribers located at specific sites. Communication is all accomplished 
via multicast to subscribers by publisher enterprise equipment. 
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[020] Use of a multicast communication transport layer avoids the 
network bottleneck, which would otherwise occur when transmitting large 
amounts of data in a point-to-point fashion. The use of "publish and subscribe" 
allows for jobs (or documents) to be assigned to specific topics or subject. 
Subjects are predefined, and each site that is interested in a subject expresses 
the interest through subscribing to the subject. This is accomplished through a 
configuration file that acts as a lookup table for the subjects that a given site is 
interested in receiving. When the software is initialized at a given site, it 
communicates what subjects it is interested in based on the information in its 
own configuration file. 

[021] The publish-subscribe middleware 160 keeps track of what sites 
are subscribed to what subject. Any site that publishes to a given subject is 
assured that all sites which have pre-subscribed to the subject will receive the 
data. This allows a site to share only the necessary data to the necessary sites. 
Network traffic is thus minimized so that only interested parties receive data and 
the data is only transmitted across a given path once in a one-to-many multicast 
fashion. 

[022] As an example of how the invention worked, XYZ is the print 
service provider. There are four print service sites across the country, which are 
referred to as DTC1-DTC4. The Catholic Church is one customer and bears a 
name associated with its own subject - CATHOLIC. The US Department of 
Defense is another customer and has two subjects DODARMY and DODNAVY. 
In this example DTC and DTC4 subscribe to CATHOLIC. DTC1 and DTC2 
subscribe to DODARMY. DTC2 and DTC3 subscribe to DODNAVY. Note that 
the DOD subjects are named such that the wildcard "DOD*" will be published to 
any DODNAVY or DODARMY site. 



Page 8 of 12 
Attorney Docket No. D/A2188 



[023] The following is a summary of which subscriber will be allowed to 
perform work for a given subject: 

Subject Valid Sites 



CATHOLIC 



DOD* 



DTC1, DTC4 

DTC1, DTC2, DTC3 (where "*" is a wildcard) 



DODARMY 



DODNAVY 



DTC1, DTC2 
DTC2, DTC3 



[024] It will be appreciated that various of the above-disclosed and other 
features and functions, or alternatives thereof, may be desirably combined into 
many other different systems or applications. Also that various presently 
unforeseen or unanticipated alternatives, modifications, variations or 
improvements therein may be subsequently made by those skilled in the art 
which are also intended to be encompassed by the following claims. 
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